智能论文笔记

SeLoC-ML: Semantic Low-Code Engineering for Machine Learning Applications in Industrial IoT

Haoyu Ren , Kirill Dorofeev , Darko Anicic , Youssef Hammad , Roland Eckl , Thomas A. Runkler

分类：人工智能 | 机器学习

2022-07-18

物联网（物联网）正在通过弥合信息技术（IT）和运营技术（OT）之间的差距来改变行业。机器正在与连接的传感器集成在一起，并通过智能分析应用程序管理，加速了数字化转型和业务运营。将机器学习（ML）带到工业设备是一个进步，旨在促进IT和OT的融合。但是，在工业物联网（IIOT）中开发ML应用程序提出了各种挑战，包括硬件异质性，ML模型的非标准化表示，设备和ML模型兼容性问题以及慢速应用程序开发。在这一领域的成功部署需要深入了解硬件，算法，软件工具和应用程序。因此，本文介绍了一个名为ML应用程序的名为“语义低代码工程”（SELOC-ML），该框架建立在低代码平台上，以利用语义Web技术来支持IIOT的ML应用程序的快速开发。 SELOC-ML使非专家能够轻松地模拟，发现，重复使用和对接ML模型和设备。可以根据匹配结果自动生成项目代码在硬件上部署。开发人员可以从称为食谱的语义应用模板中受益，从而快速原型最终用户应用程序。与工业ML分类案例研究中的传统方法相比，评估证实了至少三倍的工程努力，显示了SELOC-ML的效率和实用性。我们分享代码并欢迎任何贡献。

translated by 谷歌翻译

Image quality prediction using synthetic and natural codebooks: comparative results

Maxim Koroteev , Kirill Aistov , Valeriy Berezovskiy , Pavel Frolov

分类：计算机视觉

2022-12-20

We investigate a model for image/video quality assessment based on building a set of codevectors representing in a sense some basic properties of images, similar to well-known CORNIA model. We analyze the codebook building method and propose some modifications for it. Also the algorithm is investigated from the point of inference time reduction. Both natural and synthetic images are used for building codebooks and some analysis of synthetic images used for codebooks is provided. It is demonstrated the results on quality assessment may be improves with the use if synthetic images for codebook construction. We also demonstrate regimes of the algorithm in which real time execution on CPU is possible for sufficiently high correlations with mean opinion score (MOS). Various pooling strategies are considered as well as the problem of metric sensitivity to bitrate.

translated by 谷歌翻译

Evaluation of RGB-D SLAM in Large Indoor Environments

Kirill Muravyev , Konstantin Yakovlev

分类：机器人

2022-12-12

Simultaneous localization and mapping (SLAM) is one of the key components of a control system that aims to ensure autonomous navigation of a mobile robot in unknown environments. In a variety of practical cases a robot might need to travel long distances in order to accomplish its mission. This requires long-term work of SLAM methods and building large maps. Consequently the computational burden (including high memory consumption for map storage) becomes a bottleneck. Indeed, state-of-the-art SLAM algorithms include specific techniques and optimizations to tackle this challenge, still their performance in long-term scenarios needs proper assessment. To this end, we perform an empirical evaluation of two widespread state-of-the-art RGB-D SLAM methods, suitable for long-term navigation, i.e. RTAB-Map and Voxgraph. We evaluate them in a large simulated indoor environment, consisting of corridors and halls, while varying the odometer noise for a more realistic setup. We provide both qualitative and quantitative analysis of both methods uncovering their strengths and weaknesses. We find that both methods build a high-quality map with low odometry noise but tend to fail with high odometry noise. Voxgraph has lower relative trajectory estimation error and memory consumption than RTAB-Map, while its absolute error is higher.

translated by 谷歌翻译

DEL-Dock: Molecular Docking-Enabled Modeling of DNA-Encoded Libraries

Kirill Shmilovich , Benson Chen , Theofanis Karaletos , Mohammad M. Sultan

分类：机器学习

2022-11-30

DNA-Encoded Library (DEL) technology has enabled significant advances in hit identification by enabling efficient testing of combinatorially-generated molecular libraries. DEL screens measure protein binding affinity though sequencing reads of molecules tagged with unique DNA-barcodes that survive a series of selection experiments. Computational models have been deployed to learn the latent binding affinities that are correlated to the sequenced count data; however, this correlation is often obfuscated by various sources of noise introduced in its complicated data-generation process. In order to denoise DEL count data and screen for molecules with good binding affinity, computational models require the correct assumptions in their modeling structure to capture the correct signals underlying the data. Recent advances in DEL models have focused on probabilistic formulations of count data, but existing approaches have thus far been limited to only utilizing 2-D molecule-level representations. We introduce a new paradigm, DEL-Dock, that combines ligand-based descriptors with 3-D spatial information from docked protein-ligand complexes. 3-D spatial information allows our model to learn over the actual binding modality rather than using only structured-based information of the ligand. We show that our model is capable of effectively denoising DEL count data to predict molecule enrichment scores that are better correlated with experimental binding affinity measurements compared to prior works. Moreover, by learning over a collection of docked poses we demonstrate that our model, trained only on DEL data, implicitly learns to perform good docking pose selection without requiring external supervision from expensive-to-source protein crystal structures.

translated by 谷歌翻译

Optimizing Stock Option Forecasting with the Assembly of Machine Learning Models and Improved Trading Strategies

Zheng Cao , Raymond Guo , Wenyu Du , Jiayi Gao , Kirill V. Golubnichiy

分类：机器学习

2022-11-29

This paper introduced key aspects of applying Machine Learning (ML) models, improved trading strategies, and the Quasi-Reversibility Method (QRM) to optimize stock option forecasting and trading results. It presented the findings of the follow-up project of the research "Application of Convolutional Neural Networks with Quasi-Reversibility Method Results for Option Forecasting". First, the project included an application of Recurrent Neural Networks (RNN) and Long Short-Term Memory (LSTM) networks to provide a novel way of predicting stock option trends. Additionally, it examined the dependence of the ML models by evaluating the experimental method of combining multiple ML models to improve prediction results and decision-making. Lastly, two improved trading strategies and simulated investing results were presented. The Binomial Asset Pricing Model with discrete time stochastic process analysis and portfolio hedging was applied and suggested an optimized investment expectation. These results can be utilized in real-life trading strategies to optimize stock option investment results based on historical data.

translated by 谷歌翻译

2.5D Mapping, Pathfinding and Path Following For Navigation Of A Differential Drive Robot In Uneven Terrain

Stepan Dergachev , Kirill Muravyev , Konstantin Yakovlev

分类：机器人

2022-09-15

在机器人研究中，在不平坦的地形中安全导航是一个重要的问题。在本文中，我们提出了一个2.5D导航系统，该系统包括高程图构建，路径规划和本地路径，随后避免了障碍。对于本地路径，我们使用模型预测路径积分（MPPI）控制方法。我们为MPPI提出了新的成本功能，以使其适应高程图和通过不平衡运动。我们在多个合成测试和具有不同类型的障碍物和粗糙表面的模拟环境中评估系统。

translated by 谷歌翻译

Combining Metric Learning and Attention Heads For Accurate and Efficient Multilabel Image Classification

Kirill Prokofiev , Vladislav Sovrasov

分类：计算机视觉

2022-09-14

多标签图像分类允许从给定图像预测一组标签。与多类分类不同，每个图像只有一个标签，此类设置适用于更广泛的应用程序。在这项工作中，我们重新审视了多标签分类的两种流行方法：基于变压器的头和标签关系信息信息图处理分支。尽管基于变压器的头被认为比基于图基的分支更好地取得了更好的结果，但我们认为，使用适当的训练策略，基于图形的方法可以证明精确度的较小，同时将计算资源减少到推理上。在我们的训练策略中，我们在角度空间中引入了其修饰作用，而不是非对称损失（ASL）（ASL），而不是非对称损失（ASL）。与二进制跨熵损失相比，它隐含地学习了每个班级单位超球的代理特征向量，从而提供更好的歧视能力。根据提出的损失和训练策略，我们在单个模态方法中获得SOTA结果，以广泛的多标签分类基准，例如MS-Coco，Pascal-Voc，Nus wide和Visual Genome 500。 OpenVino培训扩展https://github.com/openvinotoolkit/deep-object-reid/tree/tree/multilabel

translated by 谷歌翻译

ScaleFace: Uncertainty-aware Deep Metric Learning

Roman Kail , Kirill Fedyanin , Nikita Muravev , Alexey Zaytsev , Maxim Panov

分类：计算机视觉 | 人工智能 | 机器学习

2022-09-05

现代基于深度学习的系统的性能极大地取决于输入对象的质量。例如，对于模糊或损坏的输入，面部识别质量将较低。但是，在更复杂的情况下，很难预测输入质量对所得准确性的影响。我们提出了一种深度度量学习的方法，该方法允许直接估算不确定性，几乎没有额外的计算成本。开发的\ textit {scaleface}算法使用可训练的比例值，以修改嵌入式空间中的相似性。这些依赖于输入的量表值代表了对识别结果的信心的度量，从而允许估计不确定性。我们提供了有关面部识别任务的全面实验，这些实验表明与其他不确定性感知的面部识别方法相比，比例表面的表现出色。我们还将结果扩展到了文本到图像检索的任务，表明所提出的方法以显着的利润击败了竞争对手。

translated by 谷歌翻译

Application of Convolutional Neural Networks with Quasi-Reversibility Method Results for Option Forecasting

Zheng Cao , Wenyu Du , Kirill V. Golubnichiy

分类：机器学习

2022-08-25

This paper presents a novel way to apply mathematical finance and machine learning (ML) to forecast stock options prices. Following results from the paper Quasi-Reversibility Method and Neural Network Machine Learning to Solution of Black-Scholes Equations (appeared on the AMS Contemporary Mathematics journal), we create and evaluate new empirical mathematical models for the Black-Scholes equation to analyze data for 92,846 companies. We solve the Black-Scholes (BS) equation forwards in time as an ill-posed inverse problem, using the Quasi-Reversibility Method (QRM), to predict option price for the future one day. For each company, we have 13 elements including stock and option daily prices, volatility, minimizer, etc. Because the market is so complicated that there exists no perfect model, we apply ML to train algorithms to make the best prediction. The current stage of research combines QRM with Convolutional Neural Networks (CNN), which learn information across a large number of data points simultaneously. We implement CNN to generate new results by validating and testing on sample market data. We test different ways of applying CNN and compare our CNN models with previous models to see if achieving a higher profit rate is possible.

translated by 谷歌翻译

Comprehensive Dataset of Face Manipulations for Development and Evaluation of Forensic Tools

Brian DeCann , Kirill Trapeznikov

分类：计算机视觉

2022-08-24

数字媒体（例如，照片，视频）可以轻松创建，编辑和共享。编辑数字媒体的工具能够这样做，同时还可以保持高度的照片现实主义。尽管数字媒体的许多类型的编辑通常是良性的，但其他类型的编辑也可以用于恶意目的。例如，最先进的面部编辑工具和软件可以人为地使一个人在不合时宜的时间里微笑，或者将权威人物描绘成虚弱和疲倦的人以抹黑个人。鉴于越来越容易编辑数字媒体和滥用的潜在风险，因此媒体取证已经大大努力。为此，我们创建了一个编辑的面部图像的挑战数据集，以帮助研究社区开发新的方法来解决和对数字媒体的真实性进行分类。我们的数据集包括应用于受控的，肖像式的额叶脸部图像和全场景图像的编辑，这些图像可能包括每个图像的多个（即多个）脸部。我们数据集的目标是解决以下挑战问题：（1）我们可以确定给定图像的真实性（编辑检测）吗？（2）如果已经编辑了图像，我们可以\ textit {entimize}编辑区域吗？（3）如果已经编辑了图像，我们可以推论（分类）执行哪种编辑类型？图像取证中的大多数研究通常试图回答项目（1），检测。据我们所知，没有专门策划的正式数据集分别评估项目（2）和（3），本地化和分类。我们的希望是，我们准备好的评估协议将帮助研究人员改善与这些挑战有关的图像取证中最新的。

translated by 谷歌翻译

HTML版本